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Abstract. 

By combining the 2MASS and DENIS infrared surveys with the 
USNO-B proper-motion catalog, it will be possible to map the structure 
of the Galaxy in unprecedented detail. The key parameter that these 
surveys measure together, but neither measures separately, is G = r^jv^, 
the time it takes a star moving at transverse speed v± to cross its own 
radius r*. This parameter cleanly separates white dwarfs, main sequence 
stars, and giants, and simultaneously separates disk from spheroid stars. 
The infrared photometry then yields photometric parallaxes for the indi¬ 
vidual stars, while the proper motions give kinematic information. Hence, 
it becomes possible to measure the physical and velocity distributions of 
the known Galactic components and to identify new ones. The analysis 
of such a huge data set poses a major technical challenge. I offer some 
initial ideas on how to meet this challenge. 


1. Introduction 

50 years from now, our catalogs will probably list position, color, and velocity 
information for 10^® stars, a large fraction of all the stars in the Galaxy and 
about 5 orders of magnitude more than today. But it is not clear how much 
more we will know about Galactic structure than we do today. For example, 
how much more would we know about the air in this room if we catalogued the 
position and velocity of its 10^^ molecules, than we do just from the temperature, 
pressure, and composition? It is appropriate to contemplate the construction of 
such a gigantic star catalogue because within 2 or 3 years we will be half way 
there (logarithmically). It is not at all obvious how we can make use of the 
forthcoming vast quantities of data to learn about the structure of our Galaxy. 

Traditionally, the Galaxy is broken down into components, and as more data 
are acquired, more components are added: disk, thick disk, spheroid, bulge/bar, 
spiral arms (e.g. Bahcall 1986). Each component is described by a few param¬ 
eters, e.g., scale height, scale length, and luminosity function for the disk. The 
study of Galactic structure is basically one of identifying these components and 
measuring their parameters with increasing precision. 

We might at first imagine taking all of our new data on the one hand, 
and all the global parameters describing the various Galactic components on 
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the other, and dumping them both into a huge maximum likelihood program 
that will in turn spit out a detailed description of the Galaxy. However, such 
an approach would overwhelm even the most optimistic estimate of near-term 
computing advances. 

2. Individual-Star Parameter Counting 

From the standpoint of Galactic structure, the three most important quantities 
to learn about each star are its distance, d, and its type, characterized by its 
radius, r*, and its temperature, T. In order to learn anything about these 
quantities from photometry, one must simultaneously measure the extinction. 
Ay. The two infrared surveys, 2MASS and DENIS, will measure fluxes in only 
three bands, JHK and UK respectively. Simple parameter counting tells us we 
cannot determine four quantities from three observables. In fact, we can extract 
only Ay, T, and 0* = r*/d, the angular radius of the star. The deviation of the 
two observed colors from the black-body two-color relation yields Ay, and the 
dereddened color gives T. From the dereddened color and flux plus the Planck 
law one then knows 0*. A white dwarf at 1 pc, a subdwarf at 25 pc, a disk dwarf 
at 40 pc, and a disk giant at 500 pc will all look approximately the same to an 
infrared survey if they have the same temperature. One way out might be to 
get a fourth color, but in fact since stars are approximately black bodies, the 
fourth color can already be predicted from the other three. 

Thus, even to isolate the individual Galactic components, let alone measure 
them, requires more information than we are going to get from an infrared sur¬ 
vey. Fortunately, before these surveys are completed, the USNO-B astrometric 
catalog should be available. 

3. Proper Motions 

From this catalog (combined with an additional epoch from 2MASS/DENIS), 
one should be able to achieve an astrometric precision of 

cr^ ~ 3masyr“^ ~ 14kms~^ kpc“^ (North), (1) 

and perhaps a factor 2 worse in the South (D. Monet 1998, private communica¬ 
tion) 

Photometry plus proper motions together yield a very curious quantity, 

v±_ v_\_/d /i 

the time it takes a star to cross its own radius. Why is this quantity of any inter¬ 
est? Because, as Table Q shows, it separates out very nicely different populations 
of stars. 

Of course, any individual star will still be ambiguous at some level. How¬ 
ever, in an analysis of millions of stars, all that is important is that most of the 
stars in a given component are so identified with ^ 70% probability. Maximum 
likelihood can then easily sort out the components statistically. Since individual 
components are separated in by a factor 3 to 5, this condition is well satisfied. 
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Table 1. Stellar Radius Crossing Times 


Star Type 

r/r© 

u©/(kms ^) 

f*/min 

Halo White Dwarf 

0.02 

200 

1 

Disk White Dwarf 

0.02 

40 

6 

Halo Sub-Dwarf 

0.5 

200 

30 

Disk Dwarf 

0.8 

40 

230 

Halo Gaint 

10 

200 

600 

Disk Gaint 

10 

40 

3000 


Virtually all of the stars detected by 2MASS have counterparts on the POSS 
E (R band) plates in the North or the equivalent ESO plates in the South, so 
proper motions should be available for all of them. 


4. The Near Zone 

At 1 kpc, a disk star has a proper motion ~ 8 mas yr which should be detectable 
at least in the North. Main-sequence disk stars down to mid-G {Mk < 4) should 
be detectable in K at the 10 ct level at this distance. There are about 10^ such 
stars. There are also about 10® detectable thick-disk star and 10^ detectable 
halo subdwarfs. The proper motions of these latter two classes of stars would 
be detectable at much larger distances (2 kpc and 5 kpc respectively). However, 
the fraction of main-sequence stars that are detectable photometrically drops 
rapidly past 1 kpc. Nevertheless, of order 10^ halo giants will be detectable in 
both the photometric and proper motion surveys. 

What sort of questions can be asked about the stars in this zone? First, of 
course, one can measure the parameters of the known components much more 
precisely than they have been measured before. For example, the disk is usually 
parameterized by a scale height h ~ 200 pc, and a scale length of H r\j 3 kpc, 
while the thick disk has h ~ 600 pc and ~ 3 kpc (Gould, Bahcall, & Flynn 
1997). The spheroid is parameterized by a power-law density fall-off ~ —3.1 
and a flattening cja = 0.8 (Gould, Flynn, & Bahcall 1998). 

However, with the vast new quantities of data that will soon be available, 
we should be able to ask many other questions about Galactic structure. What 
are the velocity distributions of the various components? Traditionally, one has 
tried to measure only the first and second moments, but higher moments will also 
be accessible. Gould &: Popowski (1998) found, for example, that the kurtoses of 
the halo as determined from ~ 150 RR Lyrae stars are ~ 2, 3 and 4 in the three 
principal directions, compared to 3 for a Gaussian and 1.8 for a sharp-edged box 
distribution. This is undoubtedly telling us something about the origin of the 
stellar halo, but what? Much more precise and detailed information about all 
three components will be obtained from the IR and proper motion surveys. Are 
the scale height and scale length of the disk and thick disk functions of spectral 
type? Are there additional components? For example, Sommer-Larsen &: Zhen 
(1990) and Gould et ah (1998), each some found evidence for a two-component 
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halo, one roughly round and the other highly flattened (but not rotating). The 
high vertical kurtosis found by Gould &: Popowski (1998) could also be regarded 
as evidence for this. Or it is possible, that our current disk/thick disk dichotomy 
should be replaced by a continuum of components, as suggested by Norris &: 
Ryan (1991). 

5. Complementarity of SDSS 

If there are a continuum of components, they will be difficult to distinguish from 
discrete components using IR-survey plus proper-motion data alone. The argu¬ 
ment codified in Table |I] was that discrete components could be separated based 
on their radius crossing times, t*. However, this assumed that the components 
are separated by a factor two in t*. 

To distinguish finer graduations requires a metallicity-sensitive indicator. 
This will not be available from 2MASS and DENIS, but will be from the Sloan 
Digital Sky Survey (SDSS), which has 5-color photometry out to the atmospheric 
cutoff. Unfortunately, in its present incarnation, SDSS will saturate at 15th 
magnitude, implying very little overlap with AT < 14 surveys (and then only 
for stars with poor IR photometry.) Moreover, SDSS will cover only regions far 
from the plane. 

However, from the standpoint of the statistical separation of components, 
it is not necessary that there be SDSS and IR photometry for the same stars. 
It is only necessary that the SDSS stars be close enough that they have proper 
motions reliable to better than a factor 2. The transverse motions of thick-disk 
stars are ~ lOOkms”^, implying factor 2 proper motions errors at a distance 
~ 3kpc. At this distance, the entire main sequence is unsaturated. 


6. Mass Distribution of Disk 

Traditionally, measurements of the local mass density have relied on stellar radial 
velocity and density measurements in a cone perpendicular to the Galactic plane, 
(e.g. Bahcall 1984; Kuijken &: Gilmore 1989, 1991; Bahcall, Flynn &: Gould 1992; 
Flynn & Fuchs 1994). Greze et al. (1998) pioneered a radically different type 
of survey based on Hipparcos parallaxes and proper motions of ~ 3000 nearby 
stars. This study was by far the most sensitive to mass close to the Galactic 
plane. They found that the local mass density is almost completely accounted for 
by visible material. However, because of the Hipparcos magnitude limit (V < 8) 
they were able to probe only within 100 pc of the Sun. Moreover, since they 
were mainly restricted to bright (and hence young) A and F stars, it is possible 
that their sample was not dynamically mixed and hence subject to systematic 
errors. 

IR surveys (plus proper motions) will provide a much more robust mea¬ 
surement of the density close to the plane, first because the sample will be three 
orders of magnitude larger than the Hipparcos sample, and second because the 
stars will be primarily G and later and thus older and more dynamically mixed 
and so less subject to systematic errors. 

Of course, unlike the Hipparcos stars, these stars will not have trigonometric 
parallaxes. One will be forced to rely on photometric parallaxes. Will the 
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inevitable distance errors, including systematic errors, compromise the result. 
Remarkably, no. The density, /?, is given by Poisson’s equation 


1 dK^ 
AttG dz ’ 


( 3 ) 


where Kz{z) is the vertical acceleration as a function of height 2 ; above the plane. 
For simplicity, I consider an isothermal population of stars, but the argument I 
am about to give applies equally well to any velocity distribution. Then 


din v 


( 4 ) 


where and v^z) are the vertical velocity dispersion and vertical density pro¬ 
file of some stellar population. Now, suppose that there is a survey close to the 
plane (e.g. |6| < 10°), and suppose that all photometric distances are systemat¬ 
ically overestimated by a factor a relative to the true distances. Then z ^ az 
will be overestimated from the measured angular position, and Vz avz will 
be overestimated from the measured proper motion, {y —> a~^u will be under¬ 
estimated, but this has no effect, because only the derivative of Inu enters eq. 
^). Hence, from equation (^) Kz will be overestimated by /a = a, so that 
from equation p will be properly estimated despite the errors. Of course, Vz 
cannot be directly estimated from the proper motion and the distance. Rather 
Vz = fJ-bd sec b + U|| tan b, where n|| is the component of motion parallel to the 

Galactic plane. That is, = {p^dYsec^b — tan^ 6. To obtain v‘1, one must 

therefore independently estimate Uy and subtract it. For |6| < 10°, tan^ b ^ 0.03, 
so the systematic error in this correction is negligible. It should be possible to 
obtain excellent results for 2 ; < 170 pc (assuming sensitivity to stars at 1 kpc). 
For higher b, the systematic errors will become progressively worse. However, 
it will be possible to map the mass density with much greater precision than 
previously, especially close to the plane. 


7. The Far Zone 

Paczyhski & Stanek (1998) have recently suggested /-band luminosities of clump 
giants (metal rich core-helium burning stars) as standard candles. Whether 
clump giants eventually turn out to have the same /-band luminosities inde¬ 
pendent of age, metallicity, and other environmental factors remains open to 
question. However, the bulge clump giants can reasonably be expected to have 
a common set of environmental characteristics and therefore act at least as rela¬ 
tive standard candles for the bulge. That is, they may or may not tell us about 
the galactocentric distance, but they can certainly trace the density structure 
of the bulge. With Kq ~ 13 and Iq ~ 14, they will be visible in the bulge in 
all regions with Ay < 9. They can be dereddened in the standard way. Hence, 
they will provide a three dimensional map of the bulge. The typical proper 
motions (from internal dispersion) are only ~ 3masyr“^. Since these are the 
same order as the errors, it may be possible to obtain statistical information on 
the velocity dispersion as a function of position, although systematics will be a 
major concern. 
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Spiral arms contain large numbers of early type stars, probably of common 
metallicity, which should provide good relative distances. However, they will 
yield almost no proper-motion information. 

It will also be possible to map the three dimensional distribution of dust 
using early type stars and clump giants. 

8. Likelihood 

We can imagine, then, maximizing the likelihood of a given model represented by 
say 100 parameters, given the observation of say 400,000,000 data values. The 
parameters being disk scale height, scale length, luminosity function, thick disk 
parameters, spheroid parameters, velocity ellipsoids, kurtoses, bar parameters, 
spiral arm densities and pitch angles, etc. The data are JHK photometry 
and proper motions for the stars, plus other colors when available from various 
sources. In principle, one could just dump all this in a giant black-box likelihood 
program and wait for it to spit out the answer. A big job, perhaps, but one can 
always hope that computers will improve sufficiently by then. Good luck! 

In fact, new computational methods will be required. The evaluation of 
the likelihood function requires the prediction of the probability of hundreds of 
millions of observations for a huge ensemble of models spanning a vast parameter 
space. Here I outline one possible approach based on the one used by Gould et 
al. (1998). 

Consider a bin in observation space J, H, K, a, 5, The probability of 

observing n stars in this bin is 

where r is the expected density of stars times the volume of the parameter-space 
bin. Then, by definition, L = Y\p Pp where (3 = ( J, H, K, a, 5, fia, Hence, 

InL = ^nplnTp-^Tfs -^Innpl ( 6 ) 

0 0 0 

In the limit of very small bins n ^ 0,1, which implies that Inn^! = 0. Hence, 

lnL= ^ Inr^-Aexp (7) 

/3,detections 

where Aexp is the total number of detections predicted by the model. 

We can now write Tp as a vector product of two types of quantities, one 
of which depends on the choice of model parameters, and the other of which is 
independent. For example, assuming disk model parameters scale height h, scale 
length H, and A'-band luminosity function with 1-mag averaged bin densities 
one can write, 


Ti,j,k,p 


l,m 


( 8 ) 
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Here i,j,k,p,q are observational bins, i = {H — K)q, j = Kq, k = {a, 6), 
and p = The model-dependent density is given by h'k,mih,H) as a 

function of distance modulus m, position vector k, and model parameters h 
and H. The model-dependent observed proper motion distribution, 
is given as a function of k, m, and proper-motion vector p, together with the 
model parameters vi,V 2 ,V 3 bulk velocity and (Ti,cj 2 , 0 ' 3 , dispersions. However, 
the color-mag diagram, which requires a cumbersome numerical integration over 
observational errors, is given once and for all. 

Similarly, one can write A^exp as 


A^exp ^ ^ ^f)cmdtot,fc,i,m) 

k,l,m 


(9) 


where 


CIXldtot,fc,/,m 


Ecmdij,fc,/,™. 


( 10 ) 


Again cmdtot,fe,«,m) which involves an enormous numerical integration, needs to 
be calculated only once. Equation (p^) (and its derivatives with respect to all 
parameters) can be calculated quite easily. If one were to really try to evaluate 
equation (|^ for each of 10^ stars for each iteration, it would still be a cum¬ 
bersome project. Just storing the matrix elements for the model-independent 
function might prove unwieldy. However, likelihood space can be 

searched on progressively larger subsets of the data, say first using 10^ stars to 
locate the approximate solution, and then increasing the sample to get more 
precise results. The last iterations would be time consuming, but there would 
be few of them. 


9. Conclusions 

The completion of the 2MASS and DENIS surveys, together with the USNO-B 
astrometric catalog will revolutionize our knowledge of Galactic structure. The 
astrometric input is required not only to obtain kinematic information, but also 
to separate out the various components of the Galaxy, which otherwise would 
be almost totally degenerate. The computational facilities required to interpret 
these data sets appear at first sight daunting. I have tried to give some initial 
suggestions on how the analysis can be simplified. 
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